AITopics | input node

Collaborating Authors

input node

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

New Complexity-Theoretic Frontiers of Tractability for Neural Network Training

Neural Information Processing SystemsFeb-16-2026, 14:16:52 GMT

A neural network (cf. Figure 1) can be thought of as a directed acyclic network consisting of

architecture, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Maryland > Baltimore (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(9 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Add feedback

beed13602b9b0e6ecb5b568ff5058f07-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-13-2026, 22:00:39 GMT

architecture, optimization, search space, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.36)

Add feedback

Neural Sculpting: Uncovering hierarchically modular task structure in neural networks through pruning and network analysis

Neural Information Processing SystemsFeb-10-2026, 13:44:52 GMT

The high-level question in this work is: If we learn a task using a sufficiently deep NN, how can we uncover the underlying hierarchical organization of sub-functions in that task?

artificial intelligence, function graph, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Europe > Middle East > Cyprus > Nicosia > Nicosia (0.04)
Asia > India (0.04)

Genre: Research Report > New Finding (0.92)

Industry:

Health & Medicine (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)

Add feedback

Optimization and Regularization Under Arbitrary Objectives

Lakhani, Jared N., Pienaar, Etienne

arXiv.org Machine LearningNov-26-2025

This study investigates the limitations of applying Markov Chain Monte Carlo (MCMC) methods to arbitrary objective functions, focusing on a two-block MCMC framework which alternates between Metropolis-Hastings and Gibbs sampling. While such approaches are often considered advantageous for enabling data-driven regularization, we show that their performance critically depends on the sharpness of the employed likelihood form. By introducing a sharpness parameter and exploring alternative likelihood formulations proportional to the target objective function, we demonstrate how likelihood curvature governs both in-sample performance and the degree of regularization inferred by the training data. Empirical applications are conducted on reinforcement learning tasks: including a navigation problem and the game of tic-tac-toe. The study concludes with a separate analysis examining the implications of extreme likelihood sharpness on arbitrary objective functions stemming from the classic game of blackjack, where the first block of the two-block MCMC framework is replaced with an iterative optimization step. The resulting hybrid approach achieves performance nearly identical to the original MCMC framework, indicating that excessive likelihood sharpness effectively collapses posterior mass onto a single dominant mode.

likelihood, posterior, regularization, (15 more...)

arXiv.org Machine Learning

2511.19628

Country: Africa > South Africa > Western Cape > Cape Town (0.04)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Games > Tic-Tac-Toe (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.87)

Add feedback

Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks - the GATTACA Framework

Mizera, Andrzej, Zarzycki, Jakub

arXiv.org Artificial IntelligenceNov-18-2025

Cellular reprogramming, the artificial transformation of one cell type into another, has been attracting increasing research attention due to its therapeutic potential for complex diseases. However, identifying effective reprogramming strategies through classical wet-lab experiments is hindered by lengthy time commitments and high costs. In this study, we explore the use of deep reinforcement learning (DRL) to control Boolean network models of complex biological systems, such as gene regulatory and signalling pathway networks. We formulate a novel control problem for Boolean network models under the asynchronous update mode, specifically in the context of cellular reprogramming. To solve it, we devise GATTACA, a scalable computational framework. To facilitate scalability of our framework, we consider previously introduced concept of a pseudo-attractor and improve the procedure for effective identification of pseudo-attractor states. We then incorporate graph neural networks with graph convolution operations into the artificial neural network approximator of the DRL agent's action-value function. This allows us to leverage the available knowledge on the structure of a biological system and to indirectly, yet effectively, encode the system's modelled dynamics into a latent representation. Experiments on several large-scale, real-world biological networks from the literature demonstrate the scalability and effectiveness of our approach.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2505.02712

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

New Complexity-Theoretic Frontiers of Tractability for Neural Network Training

Neural Information Processing SystemsOct-9-2025, 04:58:36 GMT

A neural network (cf. Figure 1) can be thought of as a directed acyclic network consisting of

architecture, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Maryland > Baltimore (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(9 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Add feedback

3b1675de6b49cc00084374213f8c38ae-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 12:12:36 GMT

function graph, hierarchical structure, visualization, (14 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
Europe > Middle East > Cyprus > Nicosia > Nicosia (0.04)
Asia > India (0.04)

Genre: Research Report > New Finding (0.92)

Industry:

Health & Medicine (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)

Add feedback

Rethinking Probabilistic Circuit Parameter Learning

Liu, Anji, Shao, Zilei, Broeck, Guy Van den

arXiv.org Artificial IntelligenceOct-7-2025

Probabilistic Circuits (PCs) offer a computationally scalable framework for generative modeling, supporting exact and efficient inference of a wide range of probabilistic queries. While recent advances have significantly improved the expressiveness and scalability of PCs, effectively training their parameters remains a challenge. In particular, a widely used optimization method, full-batch Expectation-Maximization (EM), requires processing the entire dataset before performing a single update, making it ineffective for large datasets. Although empirical extensions to the mini-batch setting, as well as gradient-based mini-batch algorithms, converge faster than full-batch EM, they generally underperform in terms of final likelihood. We investigate this gap by establishing a novel theoretical connection between these practical algorithms and the general EM objective. Our analysis reveals a fundamental issue that existing mini-batch EM and gradient-based methods fail to properly regularize distribution changes, causing each update to effectively ``overfit'' the current mini-batch. Motivated by this insight, we introduce anemone, a new mini-batch EM algorithm for PCs. Anemone applies an implicit adaptive learning rate to each parameter, scaled by how much it contributes to the likelihood of the current batch. Across extensive experiments on language, image, and DNA datasets, anemone consistently outperforms existing optimizers in both convergence speed and final performance.

artificial intelligence, machine learning, node, (15 more...)

arXiv.org Artificial Intelligence

2505.19982

Country:

North America > United States > California (0.28)
Europe (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.66)

Add feedback

beed13602b9b0e6ecb5b568ff5058f07-AuthorFeedback.pdf

Neural Information Processing SystemsAug-22-2025, 02:13:57 GMT

Thanks for the comments and we will reorganize the paper according to your suggestions. R1 may think NA T as a NAS method. How to get skip connections in VGG? Then, NA T can add skip connections into VGG by replacing the null connections (see more discussions in Section 4.5). Why the generated networks have two inputs "-2" and "-1": "-1" represent the outputs of the second nearest and the most nearest cell in front of the current one, respectively.

architecture, optimization, search space, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.36)

Add feedback

Filters

Collaborating Authors

input node

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

New Complexity-Theoretic Frontiers of Tractability for Neural Network Training

beed13602b9b0e6ecb5b568ff5058f07-AuthorFeedback.pdf

Neural Sculpting: Uncovering hierarchically modular task structure in neural networks through pruning and network analysis

b1adda14824f50ef24ff1c05bb66faf3-Supplemental.pdf

Optimization and Regularization Under Arbitrary Objectives

Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks - the GATTACA Framework

New Complexity-Theoretic Frontiers of Tractability for Neural Network Training

3b1675de6b49cc00084374213f8c38ae-Paper-Conference.pdf

Rethinking Probabilistic Circuit Parameter Learning

beed13602b9b0e6ecb5b568ff5058f07-AuthorFeedback.pdf